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ABSTRACT 


'This  paper  examines  statistical  inference  for  P(Y  <  X),  where  X  and 
Y  are  independent  normal  variates  with  unknown  means  and  variances.  The  case 
of  unequal  variances  is  stressed.  X  can  be  interpreted  as  the  strength  of  a 
component  subjected  to  a  stress  Y,  and  P(Y  <  X)  is  the  components 
reliability.  For  point  estimation,  a  predictive  estimator  which  can  be 
calculated  from  the  Behrena-Fisher  distribution  is  derived  and  compared  with 
the  maximum  likelihood  and  uniformly  minimum  variance  unbiased  estimators 
through  a  simulation  study.  Two  approximate  methods  for  obtaining  confidence 
intervals  and  an  approximate  Bayesian  probability  interval  is  obtained.  The 
actual  coverage  probabilities  of  these  intervals  is  examined  by  simulation. 
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SIGNIFICANCE  AND  EXPLANATION 


The  reliability,  say  R,  of  a  system  is  often  of  interest,  where 
R  =  P(Y  <  X).  Here,  the  random  variables  X  and  Y  are  assumed  to  be 
independent,  with  X  the  strength  of  a  component  of  interest  subjected  to  a 
stress  Y.  Our  results  are  for  the  case  where  X  and  Y  are  normally 
distributed  with  unknown  means  and  unknown  variances.  (This  of  course  means 
that  after  transformation  to  logs,  that  our  results  apply  for  the  case  where 
X  and  Y  are  distributed  as  log  normal).  For  the  case  of  point  estimation 
of  R,  we  obtain  a  new  estimator  based  on  the  predictive  distribution  of 
Y  -  X.  This  is  compared,  via  a  simulation  study,  with  2  estimators  mentioned 
in  the  literature  without  too  much  analysis,  namely  the  "nearly"  maximum 
likelihood  estimate  and  the  Rao-Blackwell  UMVUE.  The  simulation  study 
supplies  evidence  for  the  conventional  wisdom  that  the  use  of  the  "nearly" 
maximum  likelihood  esimtate  is  well  advised. 

In  addition,  we  obtain  for  the  first  time  two  approximate  methods  for 
constructing  confidence  intervals  for  R,  as  well  as  an  approximate  Bayesian 
probability  interval.  The  actual  coverage  probabilities  of  these  intervals  is 
examined  by  simulation. 
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The  responeibility  for  the  wording  and  views  expressed  in  this  descriptive 
summary  lies  with  MRC,  and  not  with  the  authors  of  this  report. 


STATISTICAL  INFERENCE  FOR  RELIABILITY  FROM  STRESS 
STRENGTH  RELATIONSHIPS:  THE  NORMAL  CASE 

Benjamin  Reiser*  and  Irwin  Guttman 

1.  INTRODUCTION » 

Let  X  and  Y  be  independent  normal  random  variables  with  means  and 
2  2 

variances  ii  ,a  and  M  ,o  respectively.  Interest  is  focused  on 

xx  y  y 

statistical  inference  for  the  parameter 


where  *  is  the  standard  normal  cumulative  distribution  function.  If  X  is 
interpreted  as  the  strength  of  a  component  subjected  to  a  stress  Y,  the 
resulting  reliability  is  given  by  R.  This  type  of  situation  is  of  particular 
interest  in  probabilistic  mechanical  design  (for  example,  see  Haugen 
(1980)).  He  further  assume  that,  random  samples  of  size  n  and  m,  say 
X  -  (X.,...,X  )•  and  Y  »  ( Y  , . • . ,Y  )'  are  available  on  X  and  Y.  In 

a*  i  n  a»  i  u 

this  situation  point  and  Interval  estimation  procedures  are  discussed. 

Related  problems  have  been  widely  presented  in  the  literature.  Bimbaum 

(1956),  Bimbaum  and  McCarty  (1958)  and  Owen  et  al.  (1964)  present  non- 

parametric  confidence  limits  for  this  problem.  In  addition  Owen  et  al.  (1964) 

discuss  the  normal  case  for  m  ■  n  and  cr  -  a  and  for  paired  observations 

x  y 

on  the  jointly  normal  variates  X  and  Y.  Enis  and  Geisser  (1971)  look  at 
various  situations  from  a  Bayesian  viewpoint  giving  predictive  estimates  and 
posterior  Bayesian  limits  but  do  not  discuss  the  case  where  a*  is  not 
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necessarily  equal  to  0^.  Procedures  if  the  parameters  0y  are  known  are 

given  in  Church  and  Harris  (1970)  and  Kazumdar  (1970). 

For  the  particular  problem  we  have  defined,  Lloyd  (1980)  and  Haugen 
(1980)  use  the  (almost)  maximum  likelihood  estimate 


R1  “♦ 


X  -  Y 


/7 7 


(i.i) 


n  m 

where  X  -  )  X  /n,  Y  -  \  Y  /m,  (n  -  1)S*  -  \  (X  -  X)",  and 

i-1  1  i-1  1  i-1 

2  ®  -  2 

(m  -  1)s  *  \  (Y  -  Y;  while  Downton  (1973)  derives  the  uniformly  minimum 

y  i-1  1 

variance  unbiased  estimator  (UMVUE)  of  R,  which  we  will  denote  by  R^ .  (For 
completeness,  we  state  the  formula  for  in  Appendix  AI.)  Lloyd  obtains, 

by  the  usual  propagation  of  errors  method,  an  estimate  of  the  variance  of 

A  A 

R1 ,  which  with  the  assumption  that  R1  is  normally  distributed,  then  gives 

approximate  confidence  limits.  Since  0  <  R1  <  1  with  R1  usually  close  to 

one  this  procedure  can  give  misleading  limits.  Haugen  (1980)  obtains 

confidence  limits  of  y  -  y  based  on  the  standard  approximate  t  solution 

x  y 

to  the  Behrens-Fisher  problem.  The  limits  for  y  -  y  are  simply  divided  by 

x  y 


/7T-2 


S  and  in  turn  ♦  of  these  values  are  calculated  providing  limits 
x  y 

on  R.  The  properties  of  this  procedure  are  not  well  understood  due  to  the 

2  2 

S+S,  without  accounting  for  the  variation  of  a. 
x  y 

Kececioglu  and  Lamarre  (1978)  attempt  to  generalize  the  standard  procedure  for 
deriving  confidence  limits  for  the  tail  probability  of  one  normal  distribution 
to  this  case.  Unfortunately  their  derivation  involves  a  fundamental  error 
which  will  be  discussed  in  more  detail  below. 

In  Section  2  of  this  paper,  we  derive  a  point  estimator  of  R  using  a 
Bayesian  predictive  approach,  whose  characteristics  are  cceipared  with  those  of 

A  A 

R^  and  Rj  in  Section  3,  by  means  of  a  simulation  study.  In  Section  4,  we 


-2 


derive  confidence  limits  on  R,  first  from  the  frequentist  approach  involving 

a  non-central  t  distribution,  obtained  by  approximating  the  distribution  of 

a  weighted  sum  of  independent  Chi-squares  by  the  distribution  of  a  scaled 
2 

X  ~  variable  by  equating  the  first  two  momenta .  A  well  known  approximation 
to  the  non-central  t  is  then  utilized  to  drive  a  different  confidence 
interval.  Finally,  we  use  a  Bayesian  approach  to  derive  a  (posterior) 
confidence  interval  for  R.  These  confidence  intervals  are  compared  in 
Section  4.4  by  means  of  a  further  simulation  study. 


2.  PREDICTIVE  APPROACH 

There  is  another  approach  which  will  generate  a  point  estimate  of  the 

reliability  functions,  namely  the  Bayesian  approach  through  the  relevant 

predictive  distribution.  So  suppose  in  general,  that  the  random  variable 

w,  whose  distribution  is  f(w|8),  with  9  a  (t  *  1)  vector  of  parameters, 

is  to  be  observed  independently  of  the  data  at  hand,  say  Wj,...,wn,  where 

the  w^  are  independently  and  identically  distributed  observations  on  w. 

Then  the  predictive  distribution  of  w,  say  h,  given  the  data 

w  ■  (w4,...w  )',  is  defined  as 
~  i  n 

h(w| w)  -  J  h(w|0)p(8 1  w)d0  ,  (2.1) 

0  ~  ~  ~  ~ 

where  p(8lw)  is  the  posterior  distribution  of  9,  given  w,  that  is, 

'  (2.1a) 

i-1  1 

with  p(8)  the  prior  of  0. 


-3- 


Now  we  are  interested  in  the  reliability  R,  where 

R  -  P(Y  <  x|u  ,y  ,o2,o2)  (2.2) 

y  x  x  y 

and  suppose  we  have  available  the  data  x  =  (x1,...,xn)’  and  jg*  y1 , . . . , v„) ' , 

which  are,  respectively  n  and  ra  independent  observations  on  X,  the 

strength  variable,  and  Y,  the  stress  vairable.  If  subsequent  to  the  taking 

of  x  and  £,  we  will  observe,  independently,  X  and  Y,  where  X  has  the 

same  common  distribution  of  the  x^  and  y  has  the  same  common  distribution 

2  2 

of  the  yj ,  then,  as  is  well  known,  if  x^^  ~  N(wx,Ox>,  yj  ~  N(uy,oy),  the 
application  of  (2.1)  when  the  prior  distribution  chosen  is  the  Jeffrey’s 
diffuse  prior 

22  22-1 

P(y  ,ii  ,0  ,0  )  «  (o  o  )  (2.3) 

x  y  x  y  x  y 


leads  to  independent  predictive  distributions  h  and  h  for  X  and  Y 


which  are  such  that 


and 


x  ~  x  +  / (i  +  -)s2  t  . 

n'  x  n-1 

Y  ~  y  +  / ( 1  +  -)s2  t  , 
v  nr  y  ra-1 


(2.3a) 


(2.3b) 


where  tR_^,  tm  ^  are  independent  random  variables  with  student  t- 
distributions  of  n-1,  m-1  degrees  of  freedom  respectively.  He  note 

that  in  this  particular  case,  the  above  predictive  distributions  also  have  a 
confidence  interpretation. 

He  now  consider  the  predictive  estimate  of  R  defined  in  (2.2),  namely 


R  *  P(Y  <  X|X,X) 


(2.4) 


From  (2.3)  above,  we  have  that  (2.4)  may  be  written  as 


(2.5) 


R,  ■  p(/ (l  +  “)s2  t  -  /  (l  +  ~)s2  t  .  <  x  -  y|x,y) 
3  ^  '  ■'  y  b-1  1  n'  x  n-1 


p( ( cos  0)t  -  (sin  0)t  < 

'  m- 1  n-1 


x  - 


/(i  ♦  — )s2  +  (i  +  -)s2 

'  tn '  \w  '  n'  v 


nr  y 


n'  x 


p(bfT(«  -  1,n  -  1)  <  —  ~  _ ) 

/  ( 1  +  -)s2  +  (i  +  -)s2 

v  n'  x 


m'  y 


where  BFT  is  the  Behrens-Fisher  t  variable,  degrees  of  freedom 
( m  -  1 , n  -  1 ) ,  and  0  given  by 


0  “  tan 


/(,  .  v 

-1  v _ ny  x 

/  1  ♦  i  s2 

m  y 


(2.5a) 


For  a  given  set  of  data,  tables  of  the  Behrens-Fisher  t  are  available 
for  various  values  of  the  pairs  of  degrees  of  freedom,  and  'angle'  0.  These 
prove  to  be  not  very  helpful  in  practice  -  only  percentage  points  are 
available,  and  this  coupled  with  the  fact  that  the  calculated  value  of  0 
(with  probability  one)  is  never  a  value  used  in  the  table,  requires  reverse 
double  extrapolation,  etc.  There  is,  however,  an  approximation  due  to  Patil 
(1965),  who  sets 

BFT(ra  -  1 ,n  -  1)  *  tf/h  (2.6) 

where  tf  is  a  Student-t  variable  with  f  degrees  of  freedom,  with 

rra  -  1  _ 2 .  .  n  -  1  _,_2„i2 


f  -  4  + 


rm  -  i  z.  .n-i  . 

008  9  +^nrrsin  0J 


[- 


&-Z-V2—  cos40  +  —to  ---Li2. 


(2.6a) 


(m  -  3)  (m  -  5) 


(n  -  3)  (n  -  5) 


sin40] 


and 


f  rm  - 

f  -  2  U  - 


m  -  1  2,  .  n  -  1  .  2.i-1 

—  cos  0  +  - -  sin  0 

3  n  -  3  3 


(2.6b) 


r 


This  approximation  seems  to  work  well  for  (n,m)  as  small  as  8,  and  in  our 

A 

simulations  below  we  use  (2.6)  above  in  evaluating  of  (2.5).  The 

A 

behaviour  of  the  R^ ' s  is  now  discussed  in  the  following  sections. 

A 

3 .  SIMULATION  STUDY  OF  THE  BEHAVIOUR  OF  THE  Ri . 

For  various  sample  sizes  (n,m),  we  generated  random  samples  on  X 
and  Y  for  the  case  where  R  defined  in  (2.2)  takes  on  the  values  .99,  .90 
and  .67.  For  each  of  the  chosen  (n,m),  we  iterated  5,000  times,  and  from 
the  results  computed  the  bias  and  the  root  mean  square  error  (RMSE),  for  the 

A 

R^'s.  The  results  are  given  in  Table  1. 

A 

The  simulation  results  confirm  that  the  UMVUE  estimator  R2  is  indeed 

A 

unbiased.  In  terms  of  bias,  the  predictive  estimator  is,  in  all  our 

A  A  A 

calculations,  worse  than  either  R1  or  R^  The  bias  of  R1  seems  not  large 
(discrepancies  are  in  the  third  place). 

A 

Referring  to  RMSE,  none  of  the  R^  turn  out  to  be  best  uniformly.  The 
predictive  is  best  only  for  the  case  R  ■  .67,  and  is  otherwise  the  worst.  We 

A  A 

note  that  for  R  ■  .67,  that  R^  is  second  best  with  R2  trailing  the  other 
two,  but  that  differences  are  small. 

A 

For  the  cases  R  »  .99  or  .90,  R3  in  terms  of  RMSE,  is  always  ranked 

A  a 

last,  with  R1  and  Rj  alternating  as  the  best,  but  again  with  only  small 

A  A  A 

differences.  However,  the  difference  of  R^  or  P2  with  R^  could  be 
substantial.  For  example,  for  R  *  .99,  (n,m)  =  (10,10),  the  RMSE  of  R1 

A  A 

and  Rj  are  .0173  and  .0146  respectively,  while  the  RMSE  of  R3  is  .0266. 

We  note  that  as  the  sample  sizes  (n,m)  increase,  the  behaviour  of  the 


R^  tend  to  be  similar,  and  this  is  repeated  as  R  tends  to  .5.  As  a 


R 

n 

m 

BIAS 

RMSE 

BIAS 

RMSE 

BIAS 

RMSE 

10 

10 

-.0047 

.0173 

.0000 

.0146 

-.0161 

.0266 

10 

15 

-.0038 

.0148 

.0001 

.0126 

-.0129 

.0221 

10 

20 

-.0037 

.0144 

-.0040 

.0121 

-.0119 

.0209 

15 

10 

-.0038 

.0148 

.0001 

.0126 

-.0129 

.0221 

.99 

15 

15 

-.0031 

.0131 

-.0001 

.0114 

-.0100 

.0183 

15 

20 

-.0028 

.0120 

-.0041 

.0104 

-.0086 

.0164 

20 

10 

-.0033 

.0140 

.0003 

.0121 

-.0115 

.0204 

20 

15 

-.0028 

.0120 

.0000 

.0104 

-.0087 

.0164 

20 

20 

-.0023 

.0107 

-.0037 

.0091 

-.0071 

.0141 

10 

10 

-.0053 

.0662 

.0019 

.0673 

-.0240 

.0700 

10 

15 

-.0052 

.0605 

.0007 

.0613 

-.0207 

.0636 

10 

20 

-.0055 

.0572 

-.0009 

.0576 

-.0195 

.0601 

15 

10 

-.0052 

.0605 

.0007 

.0613 

-.0207 

.0636 

.90 

15 

15 

-.0036 

.0533 

.0011 

.0539 

-.0161 

.0556 

15 

20 

-.0049 

.0505 

-.0025 

.0510 

-.0159 

.0527 

20 

10 

-.0040 

.0565 

.0018 

.0567 

-.0180 

.0590 

20 

15 

-.0052 

.0513 

.0002 

.0511 

-.0162 

.0535 

20 

20 

-.0037 

.0471 

-.0009 

.0476 

-.0131 

.0488 

10 

10 

-.0039 

.1161 

-.0016 

.1181 

-.0135 

.1105 

10 

15 

-.0015 

.1071 

.0003 

.1085 

-.0096 

.1025 

10 

20 

-.0013 

.1007 

.0000 

.1018 

-.0085 

.0969 

15 

10 

-.0015 

.1071 

.0003 

.1085 

-.0096 

.1026 

.67 

15 

15 

-.0011 

.0946 

.0003 

.0957 

-.0078 

.0914 

15 

20 

-.0006 

.0879 

-.0014 

.0882 

-.0064 

.0853 

20 

10 

-.0028 

.1026 

-.0003 

.1044 

-.0099 

.0989 

20 

15 

.0006 

.0879 

.0033 

.0903 

-.0053 

.0852 

20 

20 

-.0023 

.0833 

-.0034 

.0855 

-.0073 

.0812 

[  Table  1.  Comparisons  of  the  R^  based  on  5,000  iterations. 

A  A  * 

R,,  see  (1.2 )»  R,,  see  (AI.3)»  R_ ,  see  (2.5) 

fc  1  2  3 

It 

* 

i 


r 


consequence  of  this  and  the  results  above,  and  because  of  the  simplicity  in 
computing  R  ,  we  favour  its  use  as  a  point  estimator. 

We  also  note  that  for  small  samples,  the  RMSE  is  quite  large,  so  that 
point  estimates  here  are  not  very  satisfactory.  It  is  for  this  reason  that  w<> 
now  consider  the  question  of  interval  estimation. 


4.  CONFIDENCE  LIMITS  ON  R. 

4.1  The  Frequent! st  Approach 

Usually  a  lower  bound  on  the  reliability  is  of  interest  and  we  will 
restrict  ourselves  to  this  case.  Since  4  is  a  monotonically  increasing 


//  2  2 

function  of  6  -  (y  -  u  )//  o  +  a  finding  a  lower  confidence  bound  for 
x  y  /  x  y 

R  is  equivalent  to  finding  one  for  5.  Our  problem  has  essentially  the  same 

structure  as  the  Behrens-Fisher  problem  and  the  solution  presented  below  is 

similar  in  spirit  to  the  standard  approximate  "t"  solution  to  the  Behrens- 

Fisher  problem.  2  2 

a *  o * 

Note  that  X  -  Y  ~  n(  u  -  u  ,  —  +  — *■) , 

x  y  n  n 


and 


2  2  2 

(n  -  1)S  /o  ~  x  , 
x  x  An-1 


(m 


2  2 
1)S  /a 

y  y 


2 

Vi  ' 


with  all  three  of  the  above  being  pairwise  independent.  Let  N 
then 


or  N(/N  6,1)  . 


(4.1) 


In  the  equal  sample  alee  case,  N  *  n  ■  ».  In  addition 


2  2 

2  2  °x  2  "v  2 

Sx  y  n  -  1  Xn-1  m  -  1  xm-1 


or  approximately 


2  .  2 
o+c  . 
ax  v  2 

f  xf 


(4.2) 


(The  notation  a  stands  for  "approximately  distributed  as")  where 


.  ,  2  2,2/  *  °» 

'•x  y ;  /  n  -  1  m  -  1 


and  s2  +  S2  is  distributed  independently  of  (4.1).  Defining  6  ■  — - - — 

x  y  /  2 

/  S  +  S 
x 


we  obtain  from  (4.1)  and  (4.2)  that 

A 

✓5  «  a  t,{/5  6)  (4.3) 

where  tf (/n  6)  denotes  a  non-central  t  distributed  variate  with  f 
degrees  of  freedom  and  non-centrality  parameter  /v  6 .  if  n  *  m  and  it  is 
known  that  a*  «  o^,  (4.3)  holds  exactly  and  leads  to  the  well  known  sampling 

theory  solution  (see  Owen  et  al.  (1964),  Enis  and  Geisser  (1971)).  By  using 

2  2  * 

S  and  S  in  the  formula  for  N  and  f  the  estimates  N  and  f  are 
x  y 

obtained  giving  the  further  approximation 


/"n  6  ~  t*(/H  6) 


(4.4) 


For  equal  sample  sises  N  »  n  *  m  does  not  require  estimation.  Since  the 
non-central  t  distribution  has  the  monotone  likelihood  ratio  property  an 
approximate  1  -  o  level  lower  confidence  bound  for  6  can  be  obtained  by 


solving 


Prob( 


t2( /h  S)  <  /"n  6)  -  1  - 


(4.5) 


numerically  for  6  (Lehmann  (1958)).  Denoting  this  solution  by  6^,  the 
l-o  level  approximate  lower  confidence  bound  for  R  is  then  simply 
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*<  M 


Rl£  "  Kececiogle  and  Lamarre  (1978)  take  a  similar  approach  but  make 

the  mistaken  assumption  that  N  •*  f  +  1 . 

A  simplification  is  possible  by  using  the  well  known  approximation 

2  1/2 

<tf(Y)  -  y )/( 1  +  ^r~)  ~  N(0,1)  . 

Applying  this  to  (4.3)  gives 


rr  .  .2  1/2 

/  N  (<*  -  6)/(l  +  “-)  ~  N(0,1)  . 


2f 


(4.6) 


Thus  a  (1  -  a)  level  approximate  lower  confidence  bound  6 ^  can  be 
obtained  as 


21 


«  -  £  ♦  *r) 

N  2f 


2  1/2 


'1-0  ' 


(4.7) 


where  ^  is  the  1  -  o  standard  normal  percentile  point.  The 
corresponding  bound  on  R  is  then  *  *(fi2 ^).  This  result  can 

A 

alternatively  be  obtained  by  considering  6  asymptotically  to  be  normally 
distributed  with  mean  6  and  variance  estimated  by  the  usual  propagation  of 
errors  method.  Details  are  omitted  for  the  sake  of  brevity.  It  seems  more 

A 

reasonable  to  base  inference  on  a  normal  approximation  to  6  than  on  the 

A  A 

normal  approximation  to  R  as  suggested  by  Lloyd  (1980)  since  6  is 
unbounded . 

Church  and  Harris  (1970)  point  out  that  in  certain  cases  it  is  possible 
to  assume  that  the  parameters  of  the  stress  distribution  are  effectively 
known.  This  is  equivalent  to  taking  m  +  •».  It  can  readily  be  verified  that 
£im  6  .  gives  the  solution  presented  by  Church  and  Harris. 
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4.2  The  Bayesian  Approach 


The  Bayesian  approach  also  leads  to  bounds  on  the  reliability.  Given  the 
data,  and  assuming  that  the  usual  vague  priors  are  appropriate,  vis 

P<u  .u  »«*,«*)  «  1/cV  ,  (4.8) 

*  y  *  y  *  y 

2  2 

then  the  joint  posterior  of  y  ,y  ,o  ,5  can  be  obtained.  Using  the  joint 

x  y  x  y 

posterior,  we  can  in  principle  proceed  to  find  the  marginal  posterior  of 


6  - 


y  -  V 
_ 5 _ X— 

r~i  72 

/  o  +6 


(4.9) 


x  y 

and  thus  of  R  «  4(5).  To  do  this  requires  a  triple  integration.  A  method  of 

doing  this  in  closed  from  eludes  us,  but  in  principle  we  may  do  this 

numerically.  We  would  like  to  indicate  a  simpler  solution,  and  we  first  note 
2  2 

that  given  o  ,  a  ,  that  the  posterior  of  y  is  such  that 
x  y 


2  2 

0  +0 

*  X 
n  mi 

- - ’  2  2'  ' 

/  2  2  0  +  0 

/  c  +  c  x  y 
x  y 


6  ~  Nf--  *  '.£ 


(4.10) 


while  the  posterior  of  the  variances  are  such  that 

c2  ~  (n  -  1)S2/y2  .  •  independent  of  a2  -  (m  -  1)S2/x2  *  (4.11) 

x  x  *n-1  y  y  m-1 

we  see  now  that  if  n  ■  m,  the  distributional  variance  of  6  is  simply 

1/n,  and  to  find  the  unconditional  distribution  of  5,  we  need  only 

2  2 

integrate  (4.7)  with  respect  to  the  distribution  of  o  +  o  ,  where  the 

x  y 

2  2 

distribution  of  the  variances  o  and  o  is  given  in  (4.11)  with  m  ■  n. 

*  y 

At  this  point  we  will  approximate  the  distribution  of 
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02+O2~  , 

x  y  2 


b 

* -  ♦  — S- 


(4.12) 


1Xn-1  2Xn-1 


2  2 
(the  v  „  are  independent),  where  ■  (n  -  1)8  ,  b„ 
in- 1  n  x  n 


(n  - 


letting 


2  2a 

o+o - — 

x  y  2 

Xb 


1)Sy,  by 


(4.13) 


so  that  by  equating  moments,  we  let 


where 


If  we  set 


a  ■  (e  1  +  2)c 
n  n 


b  -  (e”1  +  4)  , 
n 


en  -  (n  -  5)“1dn2  +  (1  -  dn)2 

dn  "  an/(an  +  bn) 

cn  -  (n  -  3)~\an  +  bn)  . 


2  2 

w  -  0  +0 

x  y 


then  w  has  the  approximate  distribution 


(4.14) 


(4.15) 


(4.16) 


f(w|x,X)  - 


b/2  -(f  ♦  1) 


Sb/Jr(|) 


exp(-a/2w) 


(4.17) 


Confining  (4.10)  with  (4.17)  and  integrating  out  w,  we  find  that  the 
unconditional  posterior  of  6  is  such  that 


P<«!*,X> 


exp(-a/2w)dw 


(4.18) 


or,  after  setting  u  ■  w”1^2  (|dw/du|  ■  2u-2) 
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(4.19) 


P<«|x,X> 


K  J  u^”^exp[-  ~  {n[5  -  (x  -  y)u]2  +  au2}jdu 

A  • 


Xnegrating  both  aides  of  (4.19)  will  respect  to  6  yields  (a  proof  is  given 
in  Appendix  XX) 

K  m  /n  ah/2//H  2(b-1)/2r(b/2)  (4.20) 

To  find  a  lower  (1  -  a)  Bayesian  bound  for  6,  say  6^  we  need  to 
solve,  for  6^,  the  equation 


1  -  a  -  J  p(8|x,^)d« 
*3* 


(4.21) 


which  after  an  interchange  of  the  order  of  integration,  nay  be  written  as 


b/2  •  .  ,  2  •  j-  _ 

1-a  »  “T~r -  J  u  exp{— y  }{ J  -5-  exp  -  y  [6  -  ( x-y)u]  2dfi }du  (4.22) 

2  2  r(|)  32 


so  that  we  have 


b/2  •  2 

1  -  <*  ■  : 1  U' -  J  u  exp{— [/n(  (x  -  y)u  -  » 


b— 2 

.  2  m  ( b 


(4.23) 


r(f) 


and  we  see  that  It  Is  necessary  to  solve  (4.23)  numerically  for  6^.  Once 
found  we  have  a  lower  Bayessian  bound  for  R,  viz 

*31  -  •(*„>  '  <4-24’ 

Using  (4.18)-(4.20),  it  can  be  shown  that  for  large  n. 


B(<|xfX}  *  Var(5  |x,^)  »  (1  +  , 

N  2f 


(4.25) 
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and  we  note  the  similarity  to  the  sampling  theory  results  of  (4.6).  The 
question  of  the  asymptotic  normality  of  6,  given  x,  remains  open. 

4.3  An  Example  based  on  Data  of  Kececioglu  and  Lamarre 
Kececioglu  and  Lamarre  (1978)  give  data  pertaining  to  a  mechanical 

component  (their  Example  3)  that  yields 

x  *  170,000  psi,  Sx  *  5,000  psi, 

n  -  m  ■  32 

y  *  144,500  psi,  Sy  *  8,900  psi. 

For  a  90%  confidence  lower  bound,  we  have  on  the  basis  of  the  above  data, 

Ru  -  .9822,  R2t  -  .9818,  R^  -  .9926 
These  were  of  course  calculated  as  indicated  in  Section  4.1  and  4.2.  It  is 
interesting  to  note  that  Kececioglu  and  Lamarre  obtain  the  bound  of  .980  for 
this  set  of  data,  for  1  -  a  *  .90. 

4.4  Simulation  Study  of  R^ 

The  first  part  of  our  simulation  study  compares  the  coverage  properties 

of  the  R^'s.  W®  used  2,500  iterations  with  n  «  m  *  10,  15  and  20,  with 

R  “  .99,  and  calcualted  the  R^  for  1  -  u  ■  .90.  We  then  counted  the 

number  of  times  that  R  exceeded  the  calculated  ,  with  the  proportions 

2  2 

out  of  2500  tabulated  in  Table  2.  This  procedure  was  carried  out  for  o  /a 

y  x 

set  equal  to  1  and  3.  We  note  that  the  non-Bayesian  methods  have  coverage 
very  close  to  the  nominal  confidence  level  of  .90,  even  for  very  small  sample 
sises.  We  remark  that  we  have  examined  these  bounds  for  many  cases  and  have 
observed  similar  behavior,  that  is,  the  bounds  R^  and  R^  obtained  are 
strikingly  similar,  so  that,  as  expected,  the  corresponding  estimated 
confidence  levels  are  quite  close.  This  implies  that  R^ ,  derived  as  an 
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approximation  to  ,  is  a  quite  good  approximation,  and  due  to  the  ease  of 
computation  of  Rjf'  *t8  use  *8  thus  recommended. 


Table  2.  Estimated  Confidence  levels  for  1  -  a  *  .9  (n  «  m) 
with  R  “  .99,  based  on  2,500  iterations. 


i  *  m 

R1l 

R2£ 

R3t 

10 

.908 

.897 

.956 

15 

.903 

.894 

.940 

20 

.912 

.899 

.939 

10 

.899 

.888 

.940 

15 

.899 

.889 

.928 

20 

.894 

.889 

.921 

The  coverage  of  the  Bayesian  method  is  farther  from  the  nominal  than  the 
confidence  methods  but  this  improves  with  larger  sample  size  corresponding  to 
the  results  mentioned  at  the  end  of  Section  4. 

The  second  part  of  our  simulation  study  examines  and  for 

unequal  sample  sizes  and  is  presented  in  Table  3.  Again  we  see  that  the 
coverage  is  quite  close  to  the  nominal  confidence  level  of  .90  and  that  the 
simple  R2t  method  performs  satisfactorily.  Since  for  equal  sample  sizes 
just  f  needs  to  be  estimated  while  in  the  unequal  case  both  f  and  N  are 
estimated  we  had  expected  our  methods  to  perform  not  as  well  in  the  unequal 
case  as  the  equal  case.  Comparing  Tables  1  and  3  we  see  that  this  is 
Incorrect  and  that  in  fact  there  is  no  deterioration  for  unequal  sample  sizes. 
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Table  3.  Estimated  Confidence  levels  for  1  -  o  =  .9 


(n  *= 

m),  R  - 

.99,  based 

on  2,500 

iterations 

2  2 
cr  /o 

(n,ra) 

10,15 

10,20 

15,10 

15,20 

20,10 

20,15 

y  x 

i 

R1f 

.900 

.9  00 

.900 

.901 

.901 

.904 

R2* 

.894 

.892 

.889 

.891 

.890 

.899 

R1i 

.907 

.911 

.892 

.901 

.885 

.892 

3 

R2t 

.898 

.901 

.882 

.891 

.875 

.884 

5.  DISCUSSION 

The  above  computations  of  lower  confidence  bounds  can  easily  be 
extended  to  the  case  where  strength  and  stress  are  sums  of  independent 
normal  variates,  each  of  which  can  be  sampled  separately.  In  addition  if  X 
and  Y  are  lognormally  distributed  with  X,Y  >  0,  and  since  P(Y  <  X)  = 

P(log  Y  <  log  X),  all  the  results  of  this  paper  can  be  applied  after  taking 
the  log  transformation  of  the  observations. 

In  many  situations  X  and  Y  are  not  observed  directly  but  one  or  both 
of  them  are  computed  as  functions  of  random  variables  which  can  be  observed 
(see  Haugen  (1980)  for  example).  Frequently  these  functions  are 
multiplicative  in  nature  (Avakov  (1983))  and  the  use  of  the  lognormal  is 
reasonable.  If  these  multiplicative  factors  can  be  assumed  to  be  lognormally 
distributed,  then  with  the  availability  of  sampling  data  confidence  bounds  can 
be  calculated  in  addition  to  the  point  estimates  of  Avakov. 

Although  the  normal  distribution  is  most  commonly  assumed  for  both 
strenth  and  stress  there  are  often  theoretical  or  empirical  justifications  for 
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the  use  of  other  distributions.  Distributions  such  as  the  lognormal, 
exponential.  Gamma ,  weiball,  extreme  value,  and  Maxwellian  have  been  suggested 
(Haugen  (1980),  Dhillon  (I960)).  Note  that  the  strength  and  stress 
distributions  may  come  frost  a  different  family.  Although  there  has  been  some 
discussion  in  the  literature  on  point  estimation  in  the  non-normal  case  (see 
for  example  Beg  (1980)  and  the  references  cited  there)  very  few  results  are 
available  on  interval  estimation.  Basu  (1981)  gives  some  results  in  the  Gamma 
case  and  we  have  shown  above  how  the  lognormal  situation  can  be  handled. 
Unfortunately,  inference  will  be  highly  sensitive  to  parametric  assumptions 
and  where  only  small  sample  sizes  are  available  it  will  be  very  difficult  to 
decide  which  parametric  form  is  appropriate.  The  nonparametric  approach  is 
not  very  helpful  here  due  to  its  extremely  conservative  nature  (Basu 
(1981)).  There  seems  to  be  a  need  for  some  semi-nonpar ametric  approach  which 
would  not  apply  to  all  distributions  but  just  to  some  set  of  "reasonable" 
ones. 
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APPENDIX  AI 

As  mentioned  in  the  Introduction  section,  we  state  in  this  Appendix,  the 

A 

formula  for  R2»  the  uniformly  minimum  variance  unbiased  estimator  (DMVUE) 

2 

of  R  *  P( X  <  Y),  as  derived  by  Downton  (1973),  where  X  ~  N(i»x#Ox), 

2 

Y  ~  N(y  ,0  ),  with  X  and  Y  independent.  We  are  assuming  that  samples 

y  y 

X1#...,Xn  of  n  independent  observations  on  X  and,  independently, 
Y1,...,Ym,  m  independent  observations  on  Y,  are  available.  Using  the 

A 

notation  of  our  paper,  Downton  shows  that  R2  is  given  as  follows:  Define 


S  (n 
x 


-  1  )n 


-1/2 


S  (m 
y 


-Dm 


-1/2 


(AI.l) 


and  for  |v|  <  1,  let 


h(v)  »  ■{ 


0  if  *(v)  <  -1,  for  all  |v|  <  1 
min[<Mv,1)  , 

J  (1  -  u2)<m"4)/2du,  if  ♦(v)  >  -1 

-1 


(AI.2) 


for  some  v  e  (-1,1)  , 


where 


Then, 


-  S  (n  -  1)  /- 

(x  -  y)/m  _x _  / m 

S  (m  -  1)  S  (m  -  1)  /  n 


♦  (v) 


(AI.2a) 


if 


A  >  1 


r(s-§-J-)r(! 


m  -  1 
2 


)  i 

j  h(v)(1 


n-4 

v2)  2  dv  if  Ia|  <  1 


(AI.3) 


2 

0  if 


A  <  1 


(We  note  that  on  page  557  of  Downton' s  aforementioned  paper,  the  last  two 

A  A 

lines  have  an  error  and  should  have  m  0"  and  "R^  ”  1"  interchanged). 
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APPENDIX  All 


As  advertised  in  Section  3,  we  wish  to  derive  in  this  Appendix,  the 
result  given  in  (4.20)*  We  have  from  (4.19)  that 


1  ■  J  p(6|x,£)dfi  - 


(All . 1 ) 


K  J  J  u^^expj-  —  {n[4  -  (x  -  y)u]2  +  au2]dudw 
0  2 


where  a  >  0,  and  b  >  4.  On  interchanging  the  order  of  integration,  we  have 


1  *  K  J  u^^expj-  ^  au2}{  J  exp  -  [6  -  (x  -  y)u]2d6}du  (All. 2) 

0  2  —  2 


The  inner  integral  of  (All. 2)  is  that  of  an  unnormalized  normal  density,  mean 
(x  -  y)u,  variance  1/n,  so  that  the  inner  integral  has  value  /2x/n. 

Hence 

1  ■  K  J  u^^expf-  ~  au2}du  (All. 3) 

/n  0 

2 

If  we  let  au  /2  *  w  in  the  integral,  we  easily  find 


/57  ,!«'/»  -  2  -  ’  - 

'  ■  ~F  K  — wT~  l  *  •  *» 

✓n  a  0 


(All. 4) 


so  that 


,  slb-”/2  /Tr(|) 

1  -  — 


Z^2 


(All. 4a) 


or 


as  stated. 


(All. 5) 
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